On the transferability of folding and threading potentials and sequence-independent filters for protein folding simulations
نویسندگان
چکیده
Significant progress has recently been made in de novo protein structure prediction. The Rosetta method by Baker and colleagues, which is based on the idea of assembling putative models from a library of k-mer fragments derived from known three-dimensional protein structures, proved to be particularly successful. Critical components of the Rosetta approach are various sequence-dependent as well as sequence-independent measures that are used to rank alternative models and to enhance sampling of native-like conformations. In the present work we revisit several sequence-independent filters that have been used to enhance the discrimination of native and native-like structures from misfolded structures, such as the overall compactness of the structure and its contact order. We also propose a novel sequenceindependent filter, based on the shape of the mean inter-residue radial distribution function. Using the Rosetta, Park–Levitt and CASP4 sets of decoys it is shown that sequenceindependent filters are in fact more successful in distinguishing native structures in Rosetta and CASP4 tests than commonly used knowledge-based pairwise potentials. The latter are typically designed to distinguish native structures in a population of well-folded alternatives, and they fail to discriminate between native-like and non-physically packed misfolded structures from Rosetta simulations. Moreover, a rigorous attempt to optimize pairwise potentials for recognition of homologous structures in threading by using a linear programming approach leads to further deterioration of performance in terms of recognition of native structures from the Rosetta set. These findings shed light onto the success of tailored scoring functions used in the Rosetta protocol and provide support for explicit inclusion of both sequence dependent and sequence independent measures in the design of scoring functions. A Web server that enables ranking of decoy structures according to sequence independent filters considered here is available at http://sift.chmcc.org.
منابع مشابه
Structural Characteristics of Stable Folding Intermediates of Yeast Iso-1-Cytochrome-c
Cytochrome-c (cyt-c) is an electron transport protein, and it is present throughout the evolution. More than 280 sequences have been reported in the protein sequence database (www.uniprot.org). Though sequentially diverse, cyt-c has essentially retained its tertiary structure or fold. Thus a vast data set of varied sequences with retention of similar structure and fun...
متن کاملEffects of Dimethyl Sulfoxide and Mutations on the Folding of Abeta(25-35) Peptide: Molecular Dynamics Simulations
The 25-35 fragment of the amyloid β (Aβ) peptide is a naturally occurring proteolytic by-product of its larger parent molecule that retains the amyloid characteristics and toxicity of the full length parent molecule. Aggregation of this peptide occurs rapidly in aqueous solutions and thus characterization of its folding process is very difficult. In the present study, early stages of Aβ(25–35) ...
متن کاملEnergy Study at Different Temperatures for Active Site of Azurin in Water, Ethanol, Methanol and Gas Phase by Monte Carlo Simulations
The interaction between the solute and the solsent molecules play a crucial role in understanding the various molecular processes involved in chemistry and biochemistry, so in this work the potential energy of active site of azurin have been calculated in solvent by the Monte Carlo simulation. In this paper we present quantitative results of Monte Carlo calculations of potential energies of ...
متن کاملProtein Stability, Folding, Disaggregation and Etiology of Conformational Malfunctions
Estimation of protein stability is important for many reasons: first providing an understanding of the basic thermodynamics of the process of folding, protein engineering, and protein stability plays important role in biotechnology especially in food and protein drug design. Today, proteins are used in many branches, including industrial processes, pharmaceutical industry, and medical fields. A...
متن کاملThe protein threading problem with sequence amino acid interaction preferences is NP-complete.
In recent protein structure prediction research there has been a great deal of interest in using amino acid interaction preferences (e.g. contact potentials or potentials of mean force) to align ('thread') a protein sequence to a known structural motif. An important open question is whether a polynomial time algorithm for finding the globally optimal threading is possible. We identify the two c...
متن کامل